On-Shelf Utility Mining of Sequence Data
نویسندگان
چکیده
Utility mining has emerged as an important and interesting topic owing to its wide application considerable popularity. However, conventional utility methods have a bias toward items that longer on-shelf time they greater chance generate high utility. To eliminate the bias, problem of (OSUM) is introduced. In this article, we focus on task OSUM sequence data, where sequential database divided into several partitions according periods are associated with utilities periods. address problem, propose two methods, data (OSUMS) OSUMS + , extract high-utility patterns. For further efficiency, also design strategies reduce search space avoid redundant calculation upper bounds prefix extension ( TPEU ) reduced TRSU ). addition, novel structures developed for facilitating utilities. Substantial experimental results certain real synthetic datasets show outperform state-of-the-art algorithm. conclusion, may consume large amount memory unsuitable cases limited memory, while wider real-life applications efficiency.
منابع مشابه
the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
Sequence data mining
Many interesting real-life mining applications rely on modeling data as sequences of discrete multi-attribute records. Existing literature on sequence mining is partitioned on application-specific boundaries. In this article we distill the basic operations and techniques that are common to these applications. These include conventional mining operations like classification and clustering and se...
متن کاملdata mining rules and classification methods in insurance: the case of collision insurance
assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...
15 صفحه اولA Three-Scan Mining Algorithm for High On-Shelf Utility Itemsets
In this paper, we introduce a new kind of patterns, named high on-shelf utility itemsets, which consider not only individual profit and quantity of each item in a transaction but also on-shelf periods in a database. We have thus proposed a 3-scan mining algorithm to efficiently discover the itemsets. The proposed approach adopts a new pruning strategy and an itemset-generation mechanism to prun...
متن کاملMining high on-shelf utility itemsets with negative values from dynamic updated database
Utility mining emerged to overcome the limitations of frequent itemset mining by considering the utility of an item. Utility of an item is based on user’s interest or preference. Recently, temporal data mining has become a core technical data processing technique to deal with changing data. On-shelf utility mining considers on-shelf time period of item and gets the accurate utility values of it...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Knowledge Discovery From Data
سال: 2021
ISSN: ['1556-472X', '1556-4681']
DOI: https://doi.org/10.1145/3457570